FIG. 1A 

EcoRIa 2*4 39 $1, 

GAATTCC GG TGC AGG ACG AAG CTG TTC TGG ATT TCT TAC AGT GAT GGG GAC CAG 
Arg Thr Lys Leu Phe Trp Me Ser Tyr Ser Asp Gly Asp Gin 

69 84 99 

TGT GCC TCA AGT CCA TGC CAG AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC 
Cys Ala Ser Ser Pro Cys Gin Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser 

114 129 144 159 

TAT ATC TGC TTC TGC CTC CCT GCC TTC GAG GGC CGG AAC, TGT GAG ACG CAC AAG 
Tyr lie Cys Phe Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys 

17^ 189 204 Pst la 

GAT GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TA C TGC AGT 
Asp Asp Gin Leu lie Cys Val Asn Glu Asn G)y Gly Cys GVa Gin Tyr Cys Ser 

219 234 249 264 

GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG GGG TAC TCT CTG CTG 

Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu Gly Tyr Ser Leu Leu 

279 294 309 324 

GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT GAA TAT CCA TGT GGA AAA ATA CCT 
Ala Asp Gly Val Ser Cys Thr Pro Thr Val Glu Tyr Pro Cys Gly Lys lie Pro 

Xba I 339 354 369 

AT T CTA GAA AAA AGA AAT GCC AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG 
lie Leu Glu Lys Arg Asn Ala Ser Lys Pro Gin Gly Arg lie Val Gly Gly Lys 

384 399 414 429 

GTG TGC CCC AAA GGG GAG TGT CCA TGG CAG GTC CTG TTG TTG GTG AAT GGA GCT 
Val Cys Pro Lys Gly Glu Cys Pro Trp Gin Val Leu Leu Leu Val Asn Gly Ala 

444 459 474 

CAG TTG TGT GGG GGG ACC CTG ATC AAC ACC ATC TGG GTG GTC TCC GCG GCC CAC 
Gin Leu Cys Gly Gly Thr Leu Me Asn Thr Me Trp Val Val Ser Ala Ala His 

489 504 519 534 

TGT TTC GAC AAA ATC AAG AAC TGG AGG AAC CTG ATC GCG GTG CTG GGC GAG CAC 

Cys Phe Asp Lys Me Lys Asn Trp Arg Asn Leu Me Ala Val Leu Gly Glu His 

549 564 579 594 

GAC CTC AGC GAG CAC GAC GGG GAT GAG CAG AGC CGG CGG GTG GCG CAG GTC ATC 
Asp Leu Ser Glu His Asp Gly Asp Glu Gin Ser Arg Arg Val Ala Gin Val Me 

609 Sma I 624 639 

ATC CCC AGC ACG TAC GT C CCG GG C ACC ACC AAC CAC GAC ATC GCG CTG CTC CGC 
Me Pro Ser Thr Tyr Val Pro Gly Thr Thr Asn His Asp Me Ala Leu Leu Arg 



654 669 684 699 

CTG CAC CAG CCC GTG GTC CTC ACT GAC CAT GTG GTG CCC CTC TGC CTG CCC GAA 
Leu His Gin Pro Val Val Leu Thr Asp His Val Val Pro Leu Cys Leu Pro Glu 



714 729 744 

CGG ACG TTC TCT GAG AGG ACG CTG GCC TTC GTG CGC TTC TCA TTG GTC AGC GGC 
Arg Thr Phe Ser Glu Arg Thr Leu Ala Phe Val Arg Phe Scr Leu Val Ser Gly 

759 77* Nar I 789 80^ 

TOG GGC CAG CTG CTG GAC CGT CGC GCC ACG GCC CTG GAG CTC ATG GTC CTC AAC 

Trp Gly Gin Leu Leu Asp Arg Gly Ala Thr Ala Leu Glu leu Met Val -Leu Asn 

819 834 Pst lb 849 86V 

GTG CCC CGG CTG ATG ACC CAG GAC TGC CTG CAG CAG TCA CGG AAG GTG GGA GAC 
Val Pro Arg Leu Net Thr Gin Asp Cys Leu Gin Gin Ser Arg Lys Val Gly Asp 

879 894 909 

TCC CCA AAT ATC ACG GAG TAC ATG TTC TGT GCC GGC TAC TCG GAT GGC AGC AAG 
Ser Pro Asn lie Thr Glu Tyr Met Phe Cys Ala Gly Tyr Ser Asp Gly Ser Lys 

924 939 954 969 

GAC TCC TGC AAG GGG GAC ACT GGA GGC CCA CAT GCC ACC CAC TAC CGG GGC ACG 
Asp Ser Cys Lys Gly Asp Ser Gly Gly Pro His Ala Thr His Tyr Arg Gly Thr 

984 999 1014 

TGG TAC CTG ACG GGC ATC GTC AGC TGG GGC CAG GGC TGC GCA ACC GTG GGC CAC 
Trp Tyr Leu Thr Gly lie Val Ser Trp Gly Gin Gly Cys Ala Thr Val Gly His 

1029 1044 1059 TaqI 1074 

TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG CTG CAA AAG CTC ATG 
Phe Gly Val Tyr Thr Arg Val Ser Gin Tyr lie Glu Trp Leu Gin Lys Leu Met 

1089 1104 1119 1138 

CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG CGA GCC CCA TTT CCC TAG CCCAGCAGCC 
Arg Ser Glu Pro Arg Pro Gly Val Leu Leu Arg Ala Pro Phe Pro 

Pstlc 

1148 1158 1168 1178 1188 1198 1208 

CTGGCCTGTG GAGAGAAAGC CAAGGCTGCG TCGAACTGTC CTGGCACCAA ATC CC AT AT A TTCTTCTGCA 



1218 1228 1238 1248 1258 1268 1278 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGCAGCGAG ACAGAGACAG AAACAGAGAG 



1288 1298 1308 1318 1328 1338 1348 

AGACAGAGAC AGAGAGAGAC TGAGGGAGAG ACTCTGAGGA CCATGGACAG AGACTCAAAG AGACTCCAAG 



1358 1368 1378 1388 1398 1408 1418 

ATTCAAAGAG ACTAATAGAG ACACAGAGAT GGAATAGAAA AG ATG AG AGG CAGAGGCAGA CAGGCGCTGG 



1428 1438 1448 1458 1468 1478 1488 

ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG ACAGCCCAGC TGAGCCTCCT TACCTCCCTT 



1W 1508 1518 1528 1538 "15*18 1558 

CAGCCAAGCC CCACCTGCAC GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 



'568 1578 1588 1598 1608 1618 1628 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA CAGAGATATG CACACACACG 

1638 1648 1658 1668 1678 1688 I698 

GATGCACACA CAGATGGTCA CACAGAGTAC GCAAACACAC CGATGCACAC GCACATAGAG ATATGCACAC 

1708 1718 1728 1738 17*8 1758 1768 

ACAGATGCAC ACACAGATAT AC AC AT G GAG TGCACGCACA TGCCAATGCA CGCACACATC AGTGCACACG 

1778 1788 1798 1808 1818 1828 I838 

GATGCACACA GATATGCACA CACCGATGTG CGCACACACA GATATGCACA CACATGGATG AGCACACACA 

1848 1858 1868 1 878 1888 1898 1908 

CACCAAGTGC GCACACACAC CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 

1918 1928 1938 19*8 1958 1968 1978 

CATGTGTGCT GTCCTCTGAA GGCGGTTGTT TAGCTCTCAC TTTTCTGGTT CTTATCCATT ATCATCTTCA 

1988 1998 2008 2018 2028 2038 20^8 

CTTCAGACAA TTCAGAAGCA TCACCATGCA TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC 

2058 2068 2078 2088 2098 2108 2118 

TCCCTTCGCT GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT AAACGGCTGC 



2128 2138 2148 2158 2168 EcoRlb 

GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAGGAATTC 



FIG. IB 



m^*,v^,^„„ MetValSerGlnAlaLeuArgLeuLeu 
TCAACAGGCAGGGGCAGCACTGCAGAGATTTCATCATGGTCTCCCAGGCCCTCAGGCTCCTC 
10 20 30 40 50 60 

-50 _40 i 

m^^ LeuLeuGlyLeuGlnGlyCysLeuAlaAlaGl y G1 y ValA1 aLysAlaSerGlyGly 
TGCCTTCTGCTTGGGCTTCAGGGCTGCCTGGCTGCAGGCGGGGT CGCT AAGGCCTCAGG AGGA 
70 80 90 100 110 120 

"30 -20 1 

GluThrArgAspMetProTrpLysProGlyProHisArgValPheValThrGlnGluGlu 
G A A AC ACGG G AC AT G CCGT GG A AGC CGGG G C CT C AC AG AGT CTT CGT A ACC C AGG AGG A A 
130 140 150 160 170 180 

-10 -1 +1 +10 

AlaHisGlyValLeuHisArgArgArgArgAlaAsnAlaPheLeuGluGluLeuArqPro 
GCCCACGGCGTCCTGCACCGGCGCCGGCGCGCCAACGCGTTCCTGGAGGAGCTGCGGCCG 
190 200 210 220 230 240 

+2 ° +30 
GlySerLeuGluArgGluCysLysGluGluGinCysSerPheGluGluAlaArgGluIle 
GGCTCCCTGGAGAGGG AGTGCAAGGAGGAGCAGTGCTCCTTCGAGGAGGCCCGGGAGATC 
250 260 270 280 290 300 

+4 0 +50 
PheLysAspAlaGluArgThrLysLeuPheTrpIleSerTyrSerAspGlyAspGlnCys 
TTCAAGGACGCGGAGAGGACGAAGCTGTTCTGGATTTCTTACAGTGATGGGGACCAGTGT 
31° 320 330 340 350 360 

+60 +70 
A i^n?^S roCysGlnAsnGlyGlySerC y sL y sAs P GlnLeu GlnSerTyrIleCys 
GCCTCAAGTCCATGCCAGAATGGGGGCTCCTGCAAGGACCAGCTCCAGTCCTATATCTGC 
370 380 390 400 410 420 

+80 +90 
PheCysLeuProAlaPheGluGlyArgAsnCysGluThrHisLysAspAspGlnLeuIle 
TTCTGCCTCCCTGCCTTCGAGGGCCGG AACTGTGAGACGCACAAGGATGACCAGCTGATC 
430 440 450 460 470 480 

+100 + il0 
CysValAsnGluAsnGlyGlyCysGluGlnTyrCysSerAspHisThrGlyThrLysArq 
TGTGTGAACGAGAACGGCGGCTGTGAGCAGTACTGCAGTGACCACACGGGCACCAAGCGC 
490 500 510 520 530 540 

+120 + 130 
SerCys ArgCysHisGluGlyTyr SerLeuLeuAlaAspGlyValSerCysThrProThr 
TCCTGTCGGTGCCACGAGGGGTACTCTCTGCTGGCAGACGGGGTGTCCTGCACACCCACA 
550 560 570 580 590 600 

+140 + 150 
ValGluTyrProCysGlyLysIleProIleLeuGluLysArgAsnAlaSerLysProGln 
GTTGAATATCCATGTGGAAAAATACCTATTCTAGAAAAAAGAAATGCCAGCAAACCCCAA 
6X0 620 630 640 650 660 



+±t>0 +170 
Si^S5?^ eValGlyGlyLysValCysProL y sG1 yGluCysProTrpGlnValLeuLeu 
GGCCGAATTGTGGGGGGCAAGGTGTGCCCCAAAGGGGAGTGTCCATGGCAGOTCCTOTTG 
670 680 690 700 710 720 

+180 +190 
m^^i A f" GlyAlaGlnLeuC y sG1 y G1 yThrLeuIleAsnThrlleTr P ValValSer 
ttggtgaatggagctcagttgtgtggggggaccctgatcaacaccatctgIgtgg?ct2c 

730 740 750 760 770 780 

+200 +210 
A i aA i a " i ^ GysPheAs P L y sIle LYsAsnTrpArgAsnLeuIleAlaValLeuGlyGlu 

gcggcccactgtttcgacaaaatcaagaactggaggaacctgatcgcggtgc?gggcgag 

/yu a8 ° 810 820 830 840 

+220 +210 
2^ A ^S eUSerGluHisAs P G1 y As P GluGln SerArgArgValAlaGlnValTleIle 

cacgacctcagcgagcacgacggggatgagcagagccg^ 

850 8 60 870 880 890 900 

+240 +? c n 
™?^ rThrTyrValProG1 y ThrThr AsnHi3AspIleAlaLeuLeuArqLeuHisGln 

cccagcacgtacotcccgggcatc^ 

910 920 930 940 950 960 

+260 +270 

^™i^ lLeuThrAspHisValValProLeuC y s LeuProGluArgThrPheSerGlu 
CCCGTGGTCCTCACTGACCATGTGGTGCCCCT^ 

970 980 "0 1000 1010 1020 

+280 +290 

ArgThrLeuAlaPheValArgPheSerLeuValSerGlyTrpGlyGlnLeuLeuAsDAra 

?n A n GCTGGC ?H CGTGCGCTTCTCATTGGTCAGCGGGTGGG 

1030 1040 1050 1060 1070 1080 

+300 +310 
Si^ A ™ hrAlaLeuGluLeuMetValLeuAsnValPr oArgLeuMetThrGlnASDCvs 
GGCGCCACGGCCCTGGAGCTCATGGTCCTCAACGTGCCCCGGCTGATGA^CCAG^ 
1090 1100 1110 1120 1130 1140 

+320 +330 
LeuGlnGlnSerArgLysValGlyAspSerProAsnlleThrGluTyrMetPheCvsAla 
CTGCAGCAGTCACGGAAGGTGGGAGAOTCCCCAAATATCACGGAGTACA^GTT^ 
1150 1160 1170 1180 1190 1200 

+340 +350 
GlyTyrSerAspGlySerLysAspSerCysLysGlyAspSerGlyGlyProHisAlaThr 
G^ACTCGGATGGCAGCAAGGACTCCn^ 
1210 1220 1230 1240 1250 1260 

+360 +37n 
HisTyrArgGlyThrTrpTyrLeuThrGlylleValSerTrpGlyGlnGlyCvsAlaThr 
CACTACCGGGGCACGTGGTACCTGACGGGCATCGTCAGCTGGGG^CAGGGCTG^ 
1270 128 0 1290 1300 1310 1320 



+380 +390 
ValGlyHisPheGlyValTyrThrArgValSerGlnTyrlleGluTrpLeuGlnLysLeu 
GTGGGCCACTTTGGGGTGT ACACCAGGGTCTCCCAGT ACATCGAGTGGCTGCAAAAGCTC 
1330 1340 1350 1360 1370 1380 

+400 +406 
Met ArgSerGluProArgProGlyValLeuLeuArgAlaProPhePro*** 
ATGCGCTCAGAGCCACGCCCAGGAGTCCTCCTGCGAGCCCCATTTCCCTAGCCCAGCAGC 
1390 1400 1410 1420 1430 1440 

CCTGGCCTGTGGAGAGAAAGCCAAGGCTGCGTCGAACTGTCCTGGCACCAAATCCCATAT 
1450 1460 1470 1480 1490 1500 

ATTCTTCTGCAGTTAATGGGGTAGAGGAGGGCATGGGAGGGAGGGAGAGGTGGGGAGGGA 
1510 1520 1530 1540 1550 1560 

G AC AG AG AC AG A A AC AG AG AG AG AC AG AG AC AG AG AG AG ACT G AGGG AG AG ACT CTG AGG 
1570 1580 1590 1600 1610 1620 

ACATGGAGAGAGACTCAAAGAGACTCCAAGATTCAAAGAGACTAATAGAGACACAGAGAT 
1630 1640 1650 1660 1670 1680 

GGAATAGAAAAGATG AGAGGCAGAGGCAGACAGGCGCTGGACAGAGGGGCAGGGGAGTGC 
1690 1700 1710 1720 1730 1740 

CAAGGTTGTCCTGGAGGCAGACAGCCCAGCTGAGCCTCCTTACCTCCCTTCAGCCAAGCC 
1750 1760 1770 1780 1790 1800 

CCACCTGCACGTGATCTGCTGGCCCTCAGGCTGCTGCTCTGCCTTCATTGCTGGAGACAG 
1810 1820 1830 1840 1850 1860 

TAGAGGCATGAACACACATGGATGCACACACACACACGCCAATGCACACACACAGAGATA 
1870 1880 1890 1900 1910 1920 

TGCACACACACGGATGCACACACAGATGGTCACACAGAGATACGCAAACACACCGATGCA 
1930 1940 1950 1960 1970 1980 

CACGCACATAGAGATATGCACACACAGATGCACACACAGAT ATACACATGGATGCACGCA 
1990 2000 2010 2020 2030 2040 

CATGCCAATGCACGCACACATCAGTGCACACGGATGCACAGAGATATGCACACACCGATG 
2050 2060 2070 2080 2090 2100 

TGCGCACACACAGATATGCACACACATGGATGAGCACACACACACCAAGTGCGCACACAC 
2110 2120 2130 2140 2150 2160 

ACCGATGT ACACACACAG ATGCACACACAGATGCACACACACCGATGCTGACTCCATGTG 
2170 2180 2190 2200 2210 2220 

TGCTGTCCTCTGAAGGCGGTTGTTTAGCTCTCACTTTTCTGGTTCTTATCCATTATCATC 
2230 2240 2250 2260 2270 2280 

TTCACTTCAGACAATTCAGAAGCATCACCATGCATGGTGGCGAATGCCCCCAAACTCTCC 
2290 2300 2310 2320 2330 2340 

CCCAAATGT ATTTCTCCCTTCGCTGGGTGCCGGGCTGCACAGACTATTCCCCACCTGCTT 
2350 2360 2370 2380 2390 2400 



CCCAGCTTCACAATAAACGGCTGCGTCTCCTCCGCACACCTGTGGTGCCTGCCACCCAAA 
2410 2420 2430 2240 2450 2460 

AAAAAAAAAAAAAAAAAA 
2470 2480 
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FIG. 3 
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FIG. 6 




FIG.8 



FIG. 7 

GGATCC ATG CAG CGC GTG AAC ATG ATC ATG GCA GAA TCA CCA GGC 
MET Gin Arg Val Asn MET lie MET Ala Glu Ser Pro Gly 

66 81 

CTC ATC ACC ATC TGC CTT TTA GGA TAT CTA CTC AGT GCT GAA TGT 

Leu lie Thr lie Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys 

96 111 126 

ACA GTT TTT CTT GAT CAT GAA AAC GCC AAC AAA ATT CTG AAT CGG 

Thr Val Phe Leu Asp His Glu Asn Ala Asn Lys lie Leu Asn Arg 

141 156 171 

CCA AAG AGG TAT AAT TCA GGT AAA TTG GAA GAG TTT GTT CAA GGG 

Pro Lys Arg Tyr Asn Ser Gly Lys Leu Glu Glu Phe Val Gin Gly 

186 201 216 

AAC CTT GAG AGA GAA TGT ATG GAA GAA AAG TGT AGT TTT GAA GAA 

Asn Leu Glu Arg Glu Cys MET Glu Glu Lys Cys Ser Phe Glu Glu 

231 246 261 

GCA CGA GAA GTT TTT GAA AAC ACT GAA AGA ACA AAG CTG TTC TGG 

Ala Arg Glu Val Phe Glu Asn Thr Glu Arg Thr Lys Leu Phe Trp 

276 291 306 

ATT TCT TAC AGT GAT GGG GAC CAG TGT GCC TCA AGT CCA TGC CAG 

lie Ser Tyr Ser Asp Gly Asp Gin Cys Ala Ser Ser Pro Cys Gin 

321 336 351 

AAT GGG GGC TCC TGC AAG GAC CAG CTC CAG TCC TAT ATC TGC TTC 

Asn Gly Gly Ser Cys Lys Asp Gin Leu Gin Ser Tyr lie Cys Phe 

366 381 396 

TGC CTC CCT GCC TTC GAG GGC CGG AAC TGT GAG ACG CAC AAG GAT 

Cys Leu Pro Ala Phe Glu Gly Arg Asn Cys Glu Thr His Lys Asp 

411 426 441 

GAC CAG CTG ATC TGT GTG AAC GAG AAC GGC GGC TGT GAG CAG TAC 

Asp Glu Leu lie Cys Val Asn Glu Asn Gly Gly Cys Glu Gin Tyr 

456 471 486 

TGC AGT GAC CAC ACG GGC ACC AAG CGC TCC TGT CGG TGC CAC GAG 

Cys Ser Asp His Thr Gly Thr Lys Arg Ser Cys Arg Cys His Glu 

501 516 531 

GGG TAC TCT CTG CTG GCA GAC GGG GTG TCC TGC ACA CCC ACA GTT 

Gly Tyr Ser Leu Leu Ala Asp Gly Val Ser Cys Thr Pro Thr Val 

546 561 576 

GAA TAT CCA TCT GGA AAA ATA CCT ATT CTA GAA AAA AGA AAT GCC 

Glu Tyr Pro Cys Gly Lys lie Pro lie Leu Glu Lys Arg Asn Ala 

591 606 621 

AGC AAA CCC CAA GGC CGA ATT GTG GGG GGC AAG GTG TGC CCC AAA 

Ser Lys Pro Gin Gly Arg lie Val Gly Gly Lys Val Cys Fro Lys 



636 

GGG GAG TGT CCA TGG 
Gly Glu Cys Pro Trp 

681 

TTG TGT GGG GGG ACC 
Leu Cys Gly Gly Thr 

726 

GCC CAC TGT TTC GAC 
Ala His Cys Phe Asp 

771 

GTG CTG GGC GAG CAC 
Val Leu Gly Glu His 

816 

AGC CGG CGG GTG GCG 
Ser Arg Arg Val Ala 

861 

GGC ACC ACC AAC CAC 
Gly Thr Thr Asn His 

906 

GTG GTC CTC ACT GAC 
Val Val Leu Thr Asp 

951 

ACG TTC TCT GAG AGG 
Thr Phe Ser Glu Arg 

996 

AGC GGC TGG GGC CAG 
Ser Gly Trp Gly Gin 

1041 

CTC ATG GTC CTC AAC 
Leu MET Val Leu Asn 

1086 

CAG CAG TCA CGG AAG 
Gin Gin Ser Arg Lys 

1131 

ATG TTC TGT GCC GGC 
MET Phe Cys Ala Gly 

1176 

GGG GAC AGT GGA GGC 
Gly Asp Ser Gly Gly 

1221 

TAC CTG ACG GGC ATC 
Tyr Leu Thr gly lie 



651 

CAG GTC CTG TTG TTG 
Gin Val Leu Leu Leu 

696 

CTG ATC AAC ACC ATC 
Leu lie Asn Thr lie 

741 

AAA ATC AAG AAC TGG 
Lys lie Lys Asn Trp 

786 

GAC CTC AGC GAG CAC 
Asp Leu Ser Glu His 

831 

CAG GTC ATC ATC CCC 
Gin Val lie lie Pro 

876 

GAC ATC GCG CTG CTC 
Asp lie Ala Leu Leu 

921 

CAT GTG GTG CCC CTC 
His Val Val Pro Leu 

966 

ACG CTG GCC TTC GTG 
Thr Leu Ala Phe Val 

1011 

CTG CTG GAC CGT GGC 
Leu Leu Asp Arg Gly 

1056 

GTG CCC CGG CTG ATG 
Val Pro Arg Leu MET 

1101 

GTG GGA GAC TCC CCA 
Val Gly Asp Ser Pro 

1146 

TAC TCG GAT GGC AGC 
Tyr Ser Asp Gly Ser 

1191 

CCA CAT GCC ACC CAC 
Pro His Ala Thr his 

1236 

GTC AGC TGG GGC CAG 
Val Ser Trp Gly Gin 



666 

GTG AAT GGA GCT CAG 
Val Asn Gly Ala Gin 

711 

TGG GTG GTC TCC GCG 
Trp Val Val Ser Ala 

756 

AGG AAC CTG ATC GCG 
Arg Asn Leu lie Ala 

801 

GAC GGG GAT GAG CAG 
Asp Gly Asp Glu Gin 

846 

AGC ACG TAC GTC CCG 
Ser Thr Tyr Val Pro 

891 

CGC CTG CAC CAG CCC 
Arg Leu His Gin Pro 

936 

TGC CTG CCC GAA CGG 
Cys Leu Pro Glu Arg 

981 

CGC TTC TCA TTG GTC 
Arg Phe Ser Leu Val 

1026 

GCC ACG GCC CTG GAG 
Ala Thr Ala Leu Glu 

1071 

ACC CAG GAC TGC CTG 
Thr Gin Asp Cys Leu 

1116 

AAT ATC ACG GAG TAC 
Asn lie Thr Glu Tyr 

1161 

AAG GAC TCC TGC AAG 
Lys Asp Ser Cys Lys 

1206 

TAC CGG GGC ACG TGG 
Tyr Arg Gly Thr Trp 

1251 

GGC TGC GCA ACC GTG 
Gly Cys Ala Thr Val 



1266 1281 1296 

GGC CAC TTT GGG GTG TAC ACC AGG GTC TCC CAG TAC ATC GAG TGG 
Gly his Phe Gly Val Tyr Thr Arg Va 1 Ser Gin Tyr lie Glu Trp 

1311 1326 1341 

CTG CAA AAG CTC ATG CGC TCA GAG CCA CGC CCA GGA GTC CTC CTG 
Leu Gin Lys Leu MET Arg Ser Glu Pro Arg Pro Gly Val Leu Leu 

1356 1378 1388 1398 

CGA GCC CCA TTT CCC TAG CCCAGCAGCC CTGGCCTGTG GAGAGAAAGC 
Arg Ala Pro Phe Pro 

1408 1418 1428 1438 1448 

CAAGGCTGCG TCGA ACTGTC CTGGCACCAA ATCCCATATA TTCTTCTGCA 

1458 1468 1478 1488 1498 

GTTAATGGGG TAGAGGAGGG CATGGGAGGG AGGGAGAGGT GGGGAGGGAG 

1508 1518 1528 1538 1548 

A CA GAGA CAG AAACAGAGAG AGACAGAGAC AGAGAGAGAC TGAGGGAGAG 

1558 1568 1578 1588 1598 

ACTCTGAGGA CCATGGAGAG AGA CT CAA AG AGACTCCAAG ATTCAAAGAG 

1608 1618 1628 1638 1648 

ACTAATAGAG ACACAGAGAT GGAATAGAAA AGATGAGAGG CAGAGGCAGA 

1658 1668 1678 1688 1698 

CAGGCGCTGG ACAGAGGGGC AGGGGAGTGC CAAGGTTGTC CTGGAGGCAG 

1708 1718 1728 1738 1748 

ACAGCCCAGC TGAGCCTCCT TACCTCCCTT CAGCCAAGCC CCACCTGCAC 

1758 1768 1778 1788 1798 

GTGATCTGCT GGCCCTCAGG CTGCTGCTCT GCCTTCATTG CTGGAGACAG 

1808 1818 1828 1838 1848 

TAGAGGCATG ACACACATGG ATGCACACAC ACACACGCCA TGCACACACA 

1858 1868 1878 1888 1898 

CAGAGATATG CACACACACG GATGCACACA CAGATGGTCA CACAGAGTAC 



1908 1918 1928 1938 1948 

GCAAACACAC CGA T GCACAC GCACATA GA G ATATGCACAC A C AGATGCAC 



1958 1968 1978 1988 " 1998 

ACACAGATAT ACACATGGAG TGCACGCACA TGCCAATGCA CGCACACATC 



2008 2018 2028 2038 2048 

AGTGCACACG GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA 

2058 2068 2078 2088 2098 

GATATGCACA CACATGGATG AGCACACACA CACCAAGTGC GCACACACAC 

2108 2118 2128 2138 2148 

CGATGTACAC ACAGATGCAC ACACAGATGC ACACACACCG ATGCTGACTC 

2158 2168 2178 2188 2198 

CATGTGTGCT GTCCTCTGAA GGCGGTTGTT TAGCTCTCAC TTTTCTGGTT 

2208 2218 2228 2238 2248 

CTTATCCATT ATCATCTTCA CTTCAGACAA TTCAGAAGCA TCACCATGCA 

2258 2268 2278 2288 2298 

TGGTGGCGAA TGCCCCCAAA CTCTCCCCCA AATGTATTTC TCCCTTCGCT 

2308 2318 2328 2338 2348 

GGGTGCCGGG CTGCACAGAC TATTCCCCAC CTGCTTCCCA GCTTCACAAT 



2358 2368 2378 2388 2398 

AAACGGCTGC GTCTCCTCGC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



2408 2418 2428 2438 

AAAAAAAAAA AAGGAATTCG AGCTCGGTAC CCGGGGATCC 
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